AITopics

Country: North America > United States (0.67)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology (0.67)
Transportation (0.46)
Semiconductors & Electronics (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsFeb-18-2026, 14:19:17 GMT

Probing the Decision Boundaries of In-context Learning in Large Language Models

Recent language models, such as GPT -3+ [Brown et al., 2020, Achiam et al., 2023], have demonstrated Recent attempts to understand in-context learning have focused on various aspects. On the practical side, research has investigated the impact of different factors on in-context learning.

decision boundary, large language model, machine learning, (17 more...)

Country: North America > United States > California > Los Angeles County > Los Angeles (0.14)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Neural Information Processing SystemsFeb-17-2026, 05:03:43 GMT

cf04d01a0e76f8b13095349d9caca033-Supplemental-Conference.pdf

artificial intelligence, deep learning, machine learning, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Neural Information Processing SystemsFeb-10-2026, 10:00:22 GMT

39d02e8e23bafadd7cd405f2281bc05c-Supplemental-Datasets_and_Benchmarks.pdf

benchmark, cultivar, dataset, (13 more...)

Industry:

Government (0.71)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsFeb-9-2026, 23:28:16 GMT

b6b90237b3ebd1e462a5d11dbc5c4dae-Paper.pdf

arxiv preprint arxiv, kernel, neural network, (10 more...)

Country:

North America > United States > Pennsylvania (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsFeb-8-2026, 17:35:44 GMT

A plug-and-play Transformer module for task-agnostic reasoning

While most existing approaches (e.g., prompt engineering) focus on the LLM's learned representations to patch this performance gap, our experiments actually reveal that LLM representations contain sufficient information to make good

large language model, machine learning, natural language, (16 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Salgado, Henry, Kendall, Meagan R., Ceberio, Martine

Does the Model Say What the Data Says? A Simple Heuristic for Model Data Alignment

arXiv.org Artificial IntelligenceDec-9-2025

In this work, we propose a simple and computationally efficient framework for evaluating whether machine learning models align with the structure of the data they learn from; that is, whether the model says what the data says. Unlike existing interpretability methods that focus exclusively on explaining model behavior, our approach establishes a baseline derived directly from the data itself. Drawing inspiration from Rubin's Potential Outcomes Framework, we quantify how strongly each feature separates the two outcome groups in a binary classification task, moving beyond traditional descriptive statistics to estimate each feature's effect on the outcome. By comparing these data-derived feature rankings with model-based explanations, we provide practitioners with an interpretable and model-agnostic method for assessing model-data alignment.

alignment, artificial intelligence, machine learning, (16 more...)

2511.21931

Country:

North America > United States > Texas (0.15)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area (0.98)
Health & Medicine > Diagnostic Medicine (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceDec-5-2025

LLMs Know More Than Words: A Genre Study with Syntax, Metaphor & Phonetics

Shi, Weiye, Zhang, Zhaowei, Yan, Shaoheng, Yang, Yaodong

Large language models (LLMs) demonstrate remarkable potential across diverse language-related tasks, yet whether they capture deeper linguistic properties--such as syntactic structure, phonetic cues, and metrical patterns--from raw text remains unclear. To analysis whether LLMs can learn these features effectively and apply them to important nature language related tasks, we introduce a novel multilingual genre classification dataset derived from Project Gutenberg, a large-scale digital library offering free access to thousands of public domain literary works, comprising thousands of sentences per binary task (poetry vs. novel; drama vs. poetry; drama vs. novel) in six languages (English, French, German, Italian, Spanish, and Portuguese). We augment each with three explicit linguistic feature sets (syntactic tree structures, metaphor counts, and phonetic metrics) to evaluate their impact on classification performance. Experiments demonstrate that although LLM classifiers can learn latent linguistic structures either from raw text or from explicitly provided features, different features contribute unevenly across tasks, which underscores the importance of incorporating more complex linguistic signals during model training.

artificial intelligence, large language model, natural language, (15 more...)

2512.04957

Country: Europe (0.28)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Dillavou, Sam, Rocks, Jason W, Wycoff, Jacob F, Liu, Andrea J, Durian, Douglas J

Analog Physical Systems Can Exhibit Double Descent

arXiv.org Artificial IntelligenceNov-25-2025

An important component of the success of large AI models is double descent, in which networks avoid overfitting as they grow relative to the amount of training data, instead improving their performance on unseen data. Here we demonstrate double descent in a decentralized analog network of self-adjusting resistive elements. This system trains itself and performs tasks without a digital processor, offering potential gains in energy efficiency and speed -- but must endure component non-idealities. We find that standard training fails to yield double descent, but a modified protocol that accommodates this inherent imperfection succeeds. Our findings show that analog physical systems, if appropriately trained, can exhibit behaviors underlying the success of digital AI. Further, they suggest that biological systems might similarly benefit from over-parameterization.

artificial intelligence, datapoint, machine learning, (16 more...)

2511.17825

Country: North America > United States > Pennsylvania (0.28)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Jiang, Yuhan, Otten, Matthew

Benchmarking Quantum Kernels Across Diverse and Complex Data

arXiv.org Artificial IntelligenceNov-17-2025

Quantum kernel methods have shown promise and are gaining growing use among quantum machine learning approaches to enhance the performance of kernel-based models, where support vector machines (SVMs) are a common example [1]. They have been applied to various machine learning tasks, such as classification of medical data or high-energy physics [2, 3]. An advanced enhancement to these kernel methods is the trainable quantum kernel, which employs a parameterized quantum circuit (PQC), often referred to as an ansatz. Here, a quantum circuit's gate operations are controlled by a set of externally optimized classical parameters [4, 5]. This enables the quantum kernel to be trained and adapted to the specific structure of a dataset [6]. However, despite theoretical promise, the practical deployment of quantum kernel methods is still in its very early stages. Many research studies focus on a single specific machine learning area with a few dataset samples, but an evaluation of the performance of a quantum kernel across diverse domains remains unverified, whereas this ability is common in classical kernel methods such as the linear kernel or Radial Basis Function (RBF) kernel [7]. This makes it difficult to understand the characteristics of the methods' performance from a comprehensive perspective. Furthermore, existing practice is primarily conducted on low-dimensional synthetic or introductory datasets like variants of MNIST or Iris, or aggressively reduced real-world data that goes from hundreds or more to around ten features [8-10], leaving a large gap in its application to real-world machine learning scenarios.

artificial intelligence, kernel, machine learning, (17 more...)

2511.10831

Country: North America > United States > Wisconsin > Dane County > Madison (0.14)

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)